Secondary structure of Ac-Alci„-LysH+ polyalanine peptides (n=5,10,15) in vacuo: 

Helical or not? 
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The polyalanine-based peptide series Ac-Alan-LysH^ (n=5-20) is a prime example that a sec- 
ondary structure motif which is well-known from the solution phase (here: helices) can be formed in 
vacuo. We here revisit this conclusion for n=5,10,15, using density- functional theory (van der Waals 
corrected generalized gradient approximation) , and gas-phase infrared vibrational spectroscopy. For 
the longer molecules (n=10,15) a-helical models provide good qualitative agreement (theory vs. ex- 
periment) already in the harmonic approximation. For n=5, the lowest energy conformer is not a 
simple helix, but competes closely with a- helical motifs at 300 K. Close agreement between infrared 
spectra from experiment and ah initio molecular dynamics (including anharmonic effects) supports 
our findings. 



It is often said that the structure of peptides and 
proteins can not be understood without the action 
of a solvent, and this statement is certainly true for 
the full three-dimensional (tertiary) structure of pro- 
teins. However, their secondary structure level (helices, 
sheets, turns) is predominantly shaped by intramolecu- 
lar interactions — most importantly, hydrogen bonds. For 
these interactions, benchmark experiment-theory com- 
parisons under well-defined "clean-room" conditions in 
vacuo can furnish critical information towards a com- 
plete, predictive picture of peptide structure and dy- 
namics. For example, first-principles approaches such as 
density-functional theory (DFT) with popular exchange- 
correlation functionals do not account for van der Waals 
(vdW) interactions. For peptide studies, precise experi- 
mental calibration points for different, actively developed 
theoretical remedies [1-5 would be extremely useful — if 
the same structure as in the solution phase can he formed. 

In a seminal ion-mobility spectrometry study more 
than a decade ago, Hudgins, Ratner, and Jarrold (HRJ) 
[6] reported the formation of just such a secondary struc- 
ture motif known from the solution phase (helical) in 
vacuo for a series of designed, charged polyalanine-based 
peptides Ac-Alan-LysII+ (n=5-20). While much follow- 
up work has been done after the original HRJ study 
(e.g., Refs. [7HT4]) the "helical" nature of the exact se- 
ries Ac-Alan-LysH+ is so far still established only indi- 
rectly by comparing ion-mobility cross-sections to results 
from force-field based molecular dynamics. A helical as- 
signment for polyalanine is thus plausible and easily ac- 
cepted, but different structural conclusions are not en- 
tirely ruled out. This fact is strikingly evidenced by ad- 
ditional ion- mobility experiments with microsolvation [9] , 
which indicate that the secondary structure is not yet he- 
lical for n < 8. On the other hand, a spectroscopic study 
of Ac-Phe-Ala5-LysH+ [12] inferred structures with "he- 
lical" H-bond rings {a- or 3io-helix-like) as the dominant 
conformers. 

The key goal of the present work is to unambiguously 
verify the structure of the n=5, 10, and 15 members of 



the original HRJ series, both experimentally and theo- 
retically. This is an important task, as safely knowing 
the correct structure is a key prerequisite for any further 
physical conclusions. Obviously, this is a broadly impor- 
tant statement for a wide part of physics and chemistry 
— surface science and catalysis, alloys and compounds, 
semiconductor properties etc. — but for a benchmark sys- 
tem such as the HRJ series, such safe knowledge is par- 
ticularly crucial. On the theory side, we employ DFT 
in the PBE [15 generalized gradient approximation cor- 
rected for vdW interactions [5 with an accuracy that is 
critical for the success of our work. We thereby confirm 
the helical assignment for n=10 and 15, but for n=5, 
the lowest energy structure is indeed not a simple he- 
lix. Even for such a relatively short molecule, there is 
then an enormous structural variety to navigate, an ef- 
fort which one must not shun (see below). We verify our 
findings against experimental infrared multiple photon 
dissociation (IRMPD) spectra of the vibrational modes 
in the 1000-2000 cm~^ region, which pertain to finite 
T ^300 K. Here, harmonic free-energy calculations show 
that multiple conformers for n=5 (both helical and non- 
helical) should coexist, and are supported by calculated 
vibrational spectra (harmonic and anharmonic) in close 
agreement with experiment. 

For the experiments, the peptides were synthesized by 
standard Fmoc chemistry. The experimental IR spectra 
were recorded using the Fourier transform ion cyclotron 
(FT-ICR) mass spectrometer [16] at the free-electron 
laser FELIX [17]. Ions were brought into the gas-phase 
by electrospray ionization (ESI) (~ 1 mg of peptide in 
900 fA TFA/100 fA H2O) and mass selected and trapped 
inside the ICR cell which is optically accessible. When 
the IR light is resonant with an IR active vibrational 
mode of the molecule, many photons can be absorbed, 
causing the dissociation of the ion (IRMPD). Mass spec- 
tra are recorded after 4s of IR irradiation. Monitoring 
the depletion of the parent ion signal and/or the frag- 
mentation yield as a function of IR frequency leads to an 
IR spectrum. 



All DFT+vdW calculations for this work were per- 
formed using the FHI-aims [TE\ program package for an 
accurate, all-electron description based on numeric atom- 
centered orbitals. "Tight" computational settings and 
accurate tier 2 basis sets [18 were employed throughout. 
Harmonic vibrational frequencies, intensities and free en- 
ergies were computed from finite differences. For Ac- 
Ala5-LysH+, we computed infrared intensities I{uj) be- 
yond the harmonic approximation from ab initio molec- 
ular dynamics (AIMD) runs >20 ps {NVE ensemble, 
with a 300 K NVT equilibration), by calculating the 
Fourier transform of the dipole auto-correlation function 
[T4J [191 EQ] with a quantum corrector factor to the classi- 
cal line shape [2TJ|22] proportional to cj^ (see Ref. [19 ). 
For a direct comparison to experiment, it is important 
to represent the "density of states" like nature of the 
measured spectra also in the calculated curves. All cal- 
culated spectra (harmonic and anharmonic) are therefore 
convoluted with a Gaussian broadening function with a 
variable width of 1% of the corresponding wave num- 
ber, which accounts for the spectral width of the excita- 
tion laser. Further broadening mechanisms, e.g., due to 
the excitation process, are reflected in the experimental 
data. p3^ . 

For the DFT+vdW calculations on Ac-Ala5-LysH+, 
we generate a large body of possible starting conforma- 
tions using the empirical OPLS-AA force field (as given 
in Ref. [24] and references therein) in a series of basin 
hopping structure searches performed with the TINKER 
[25] package. Our particular choice of force field was not 
motivated by any other reason than that an input struc- 
ture "generator" for DFT was needed. That said, the 
performance of OPLS-AA for gas-phase Alanine dipep- 
tides and tetrapeptides was assessed rather favorably in 
earlier benchmark work. [26l [27] In the searches, specific 
constraints on one or more hydrogen bonds could be en- 
forced. In total, we collected O(IO^) nominally different 
conformers from (i) an unconstrained search, (ii) one hy- 
drogen bond in the Alas part constrained to remain a- 
helical, (iii) two hydrogen bonds in Alas constrained to 
form a 3io-helix, (iv) three hydrogen bonds in the Alas 
part constrained to form a 2^ helix, or (v) one hydrogen 
bond in the full peptide constrained to a 7r-helical form. 
As is well known [28 , conformational energy differences 
between different types of secondary structure may vary 
strongly between different force fields and/or DFT. We 
reduce our reliance on the energy hierarchy provided by 
the force field by following up with full DFT+vdW re- 
laxations for a wide range of conformers, 134 in total. 
This range includes the lowest ~0.3 eV for the uncon- 
strained and Of- helical searches, and the lowest ~0.15 eV 
for the 3io-constrained search, as well as the lowest few 
TT- and 27-helical candidates. Almost all 7r-helical geome- 
tries found in the force-field relaxed with DFT either into 
a or 3 10 helices, and all relaxed 27 helices were higher 
in energy than our lowest-energy conformer by at least 



0.26 eV. 
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FIG. 1: Comparison between experimental [gray full lines in 
(c)] and calculated, broadened harmonic vibrational spectra 
[black, red, and blue solid lines in (c)] for a-helices of Ac- 
Ala5-LysH^(top), Ac-Alaio-LysH^ (middle) and Ac-Alai5- 
LysH^ (bottom). Also shown are the terminating H-bond 
networks used for a (a) and 3io (b). All theoretical results 
for a- and 3io-helices have been aligned to the free C-0 peak 
at - 1 790cm" \ 

Figure [T] shows the experimental IRMPD spectra for 
the three lengths of peptides studied (n=5,10,15) and cal- 
culated vibrational spectra in the harmonic approxima- 
tion for two specific types of hydrogen bond networks: a- 
helical (left) and 3io-helical (right). It is well known that 
both the choice of the density functional and the neglect 
of anharmonic effects will lead to characteristic frequency 
shifts between theoretical and experimental spectra. For 
a better visual comparison, all calculated spectra in the 
present work are therefore rigidly shifted to be aligned 
with the approximate location of the localized free C-0 
vibration at ~ 1790cm~^ in experiment for n=5. For ex- 
ample, this shift amounts to ~20 cm~^ for a-helical con- 
formers. All intensities were uniformly scaled to match 
the highest peak (Amide-I), but no further scaling factors 
(frequency or intensity) were employed. Given the limi- 
tations of the T=0 harmonic approximation when com- 
paring to room-temperature experimental spectra, the 
agreement is rather reasonable for the a-helical conform- 
ers, while this is much less the case in terms of rela- 
tive peak positions and fine structure for the 3io heli- 
cal conformers of n=10, 15. This observation correlates 
with calculated energy differences, where the 3io helical 



conformers are higher in energy than the a-hehcal ones 
by 0.41 eV (n=10) and 0.82 eV (n=15). In addition, 
an OPLS-AA based basin-hopping structure enumeration 
for n=10 did not reveal any non-a conformers within at 
least 0.15 eV. For n=15, the employed structure search 
procedure becomes prohibitive, although direct AIMD 
simulations show that a-helix conformations are struc- 
turally stable even at high T (500 K) for at least several 
tens of ps [29]. The available evidence thus points to 
at least predominantly a-helical secondary structure for 
n=10, 15. In any case, the observed disagreements for 
3io confirm the basic structure sensitivity of the mea- 
sured IRMPD spectra. 




" • 


^ (0 






(d) 


(e) 


g-1 


a-1 


a-2 


3 10-1 


0(Ac) 

0-1 

0-2 
0-3 
0-4 
0-5 


NH-4(a) 
NH-5(7i) 


NH-3(3,o) 
NH-4(a) 


NH-3(3,o) 


NH-3(3io) 


NH-3(27) 


NH-5(a) 


NH-4(3io) 
NH-5(a) 


NH-4(3io) 


NH3- 


NH3* 


NH3+ 


NH-5(3io) 


NHs* 


NH3+ 


NH3+ 


NH3- 


NH3" 


NH-6 


NH3' 


NH3- 


NH-2 


NH3^ 


NHj^ weak 


NH3^ 



FIG. 2: Geometries of Ac-Ala5-LysH+: (a) g-1; (b) a-1; (c) 
a-2; (d) 3io-l; and (e) H-bond network of the conformers: 
(C-)0 and N-H groups are numbered starting from the N 
terminus and ending at the C terminus. 

For Ac-Ala5-LysH+, the four lowest-energy conformers 
from our search and their H-bond networks are shown in 
Fig. [2] Three of these conformers (labeled a-1, a-2, 
3io-l) are "helical", in the sense that they contain two 
well-separated terminations with the appropriate a- or 
3io-like H-bond loops in their Alas section. The lowest- 
energy conformer, labelled g-1, contains only one 2^ like 
loop, H-bonds to the NH3" end of the Lys side chain, and 
one H-bond that runs against the normal helix dipole, ef- 
fectively short-circuiting the terminations. In fact, small 
structural differences in the Lys side chain lead to three 
nonequivalent conformers with the g-1 H-bond network, 
only one of which is shown here for simplicity. 
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FIG. 3: Ac-Alan-LysH^, n = 5,10,15: Calculated empirical 
ion-mobility cross sections and comparison with experiment 
(Ref. [6]). The solid and dashed lines serve only as a guide 
to the eye, and have no physical significance. 



TABLE I: Energy differences of the four chosen Ac- Alas - 
LysH+ conformers wrt. g-1: Pure DFT-PBE (no vdW), DFT- 
PBE+vdW (PES only), and harmonic free energy F at 300 K. 
All energies in eV. 



g-1 a-1 OL-2 


3io-l 


DFT-PBE 0.0 0.04 0.08 


0.04 


DFT-PBE+vdW 0.0 0.09 0.11 


0.19 


F(300 K) 0.0 0.01 0.06 


0.17 



The termination-connecting H-bond of the g-1 con- 
former also leads to an overall volume of the g-1 con- 
former that is somewhat smaller than of the a-1, a-2, 
or 3io-l conformers. This is quantified in Fig. [3] by 
way of computed empirical relative ion-mobility cross 
sections. [30 We show ^^(S^measured — 14.50n A^) as a 
function of peptide chain length n, the same expression 
as used by HRJ [6 . Remarkably, the g-1 conformer for 
n=5 together with a-helical conformers for n=10 and 15 
(dashed line) yields exactly the same qualitative behavior 
as the original data of HRJ. In contrast, our a-1 and a-2 
conformers would yield a much shallower drop towards 
n=5, whereas the 3io-l conformer ends up too high. 

In Table [ij we summarize our computed energy hier- 
archy. In DFT-PBE+vdW, the g-1 conformer is more 
stable than its closest competitors by 0.1-0.2 eV. On this 
scale, vdW interactions are important, as seen by com- 
paring to the pure DFT-PBE energy hierarchy (no vdW) . 
On the other hand, finite temperature effects reduce the 
relative stability of g-1. The calculated harmonic free 
energy of g-1 and a-1 at 300 K is almost equal, and a-2 
is only slightly (~ 60 meV) less stable; only 3io-l stays 
noticeably removed. The expected stability of at least 
three out of the four conformers is thus similar. We note 
in passing that the hierarchy for other DFT junctionals 
(revPBE, or B3LYP at fixed geometry obtained using 
the PBE functional) is qualitatively similar, as long as a 
vdW correction is included. ^31^ 
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FIG. 4: Ac-Alas-LysH^: (a) Theoretical harmonic vibrational spectra (red line) for the four chosen conformers compared with 
experiment (gray line); (b) same for anharmonic spectra from AIMD trajectories. All theoretical spectra are shifted rigidly so 
that the free CO peak (^ 1790cm~^) is aligned. 



Finally, Figj4] shows computed vibrational spectra for 
all four conformers compared to experiment. Again, we 
align the localized free C-0 vibration peak to experiment 
by a rigid shift and scale the intensities to match the 
height of the Amide-I peak. No further scaling factors 
are employed. In panel (a), spectra calculated in the 
harmonic approximation are shown. While there is an 
overall qualitative similarity of measured and computed 
spectra for all four conformers, it is also clear that none 
of them fit entirely — peak shifts and incorrect relative 
intensities (especially those obscuring the gap between 
Amide-I and -II) abound. This situation changes when 
spectra computed from ab initio molecular dynamics and 
the dipole-dipole autocorrelation are considered [panel 
(b)]. For g-1, a-1, and a-2, nearly perfect matches to ex- 
periment are obtained: The relative positions of Amide-I 
and Amide-II are almost exact, the interfering peaks in 
the gap decrease in intensity, and even the fine structure 
towards lower wave numbers is well reproduced. As one 
example, consider the experimental peak at ^1250 cm~^ 
compared to the g-1 conformer. It coincides with a mini- 
mum of the harmonic spectrum, while a theoretical peak 
lies much closer in the anharmonic case. Consistent with 
the free energy, it is thus possible and plausible that all 
three conformers contribute to the experimentally ob- 
served signal from the ions, which were held in the ion 
trap at room temperature. In contrast, the features of 
the theoretical 3io-l conformer do not match quite as 
well. If 3io-l is present in the ion trap at all, then cer- 
tainly with a much smaller fraction than g-1, a-1, and 
a-2. 

In summary, we demonstrate a quantitative structure 
prediction for Ac-Alan-LysH+ (n=5,10,15), with strong 
support by the good agreement between calculated and 
measured vibrational spectra. Our calculations provide 



a direct confirmation for the proposed a-helical nature of 
Ac-Alan-LysH+ (n=10,15), while the lowest energy con- 
former of the "classic" HRJ series, n=5, is indeed not a 
simple helix. Importantly, finite-temperature free-energy 
effects still render a-helical n=5 conformers possible and 
even rather probable in an experimental molecular beam. 
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